AUC Maximization for Low-Resource Named Entity Recognition
نویسندگان
چکیده
Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize underlying NER model. Both of these traditional objective for problem generally produce adequate performance when data distribution is balanced and there are sufficient annotated training examples. But since inherently an imbalanced tagging problem, model under low-resource settings could suffer using standard functions. Based on recent advances area ROC curve (AUC) maximization, we propose by maximizing AUC score. We give evidence that simply combining two binary-classifiers maximize score, significant improvement over loss achieved settings. also conduct extensive experiments demonstrate advantages our method highly-imbalanced To best knowledge, this first brings maximization setting. Furthermore, show agnostic different types embeddings, models domains. The code available at https://github.com/dngu0061/NER-AUC-2T.
منابع مشابه
Named Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملUsing Wikipedia as a Resource for Arabic Named Entity Recognition
In this paper we describe a novel approach to delimit named entities and nominal successive mentions in the Arabic Wikipedia. To achieve this a linguistic analysis of named entities and successive mentions in the Arabic Wikipedia, in terms of coverage and complexity, is presented. A supervised machine learning classifier has been used to predict the presence of the named entities in the Arabic ...
متن کاملA Golden Resource for Named Entity Recognition in Portuguese
This paper presents a collection of texts manually annotated with named entities in context, which was used for HAREM, the first evaluation contest for named entity recognizers for Portuguese. We discuss the options taken and the originality of our approach compared with previous evaluation initiatives in the area. We document the choice of categories, their quantitative weight in the overall c...
متن کاملNamed Entity Recognition for Ukrainian: A Resource-Light Approach
Named entity recognition (NER) is a subtask of information extraction (IE) which can be used further on for different purposes. In this paper, we discuss named entity recognition for Ukrainian language, which is a Slavonic language with a rich morphology. The approach we follow uses a restricted number of features. We show that it is feasible to boost performance by considering several heuristi...
متن کاملPhonologically Aware Neural Model for Named Entity Recognition in Low Resource Transfer Settings
Named Entity Recognition is a well established information extraction task with many state of the art systems existing for a variety of languages. Most systems rely on language specific resources, large annotated corpora, gazetteers and feature engineering to perform well monolingually. In this paper, we introduce an attentional neural model which only uses language universal phonological chara...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i11.26571